arXiv: 1504.01123v3 [cs.IT] 29 Aug 2015 


Coded Caching with Heterogenous Cache Sizes 

Sinong Wang, Wenxin Li, Xiaohua Tian, Hui Liu 
Department of Electronic Engineering, Shanghai Jiao Tong University 
{snwang,xtian,huiliu} @sjtu. edu.cn 


Abstract —We investigate the coded caching scheme under the 
heterogenous cache sizes. 

I. Introduction 

Coded caching is a novel mechanism to relieve wireless 
congestion during peak-traffic times for content distribution Q 
where the temporal variability of wireless traffics is 
utilized. With coded caching, popular contents are partially 
prefetched at users’ local cache during the placement phase, 
i.e., off-peak traffic times, and the rest of the contents are 
delivered using coded multicasting during delivery phase, i.e., 
peak traffic times upon request. Compared with the traditional 
caching scheme that adopts the orthogonal unicasting trans¬ 
mission and the caching gain is straightforwardly dependent 
on the user’s cache size 0- coded caching provides 
coding opportunities among different requests during the de¬ 
livery phase, which further exploits cache resources by jointly 
optimizing the placement and delivery phases. Efforts have 
been made to reveal the fundamental limits of coded caching in 
an information-theoretical perspective that the coded caching 
scheme in the bottleneck network can achieve a global cache 
gain to reduce the delivery-phase traffic volume and only 
exhibits the constant gap to the information-theoretical lower 
bound. 

The seminal works attract much attentions in the community 
and encourages further investigations on the coded caching 
scheme ED- (B); however, existing works have the shared 
assumption, that is, users have the identical cache size, which 
is extremely difficult to satisfy in practice. In this paper, 
we present a comprehensive study on coded caching with 
heterogeneous cache sizes at the user end. 

Several non-trivial challenges need to be addressed before 
we get the insight of coded caching scheme with heteroge¬ 
neous cache size. First, what is the fundamental bound in an 
information theoretical perspective under heterogeneous cache 
size? Second, how to implement this lower bound by designing 
the placement and delivery schemes? In particular, could the 
schemes designed for the setting of homogeneous cache size 
be applied to the heterogeneous cache size scenario? If not, 
what is the root cause and how could we design algorithms 
dedicated for the heterogeneous cache size scenario? Third, 
how the heterogeneous cache size influences the wireless 
traffic volume during the delivery phase? Is it possible to 
obtain an analytical form trade-off between cache size and 
traffic volume as in the homogeneous setting ED- E9 ? 

This paper tries to shed light on how to resolve these 
challenges, where some interesting results have been derived. 


In the traditional coded caching scheme |7j, it is assumed 
that each user has an identical memory space and the random 
caching procedure in the placement phase will produce the 
content segments of approximate equal size, and form a 
maximal clique of different segments that can be fully utilized 
to create the coding opportunities in the delivery phase. How¬ 
ever, heterogeneous cache size incurs that the size of content 
segments in the delivery phase is also heterogeneous, which 
causes problems for coding. A straightforward solution is to 
perform padding to smaller-sized segments so that all segments 
can be aligned for coding. Apparently, such an approach can 
cause larger-sized segments to miss coding opportunities in 
the delivery phase thus increase the traffic volume. 

Through analyzing the internal structure of such problem, 
we prove above zero-padding solution is the optimal coded 
delivery scheme under the random caching procedure, and 
such miss coding phenomenon is impossible to counteract in 
this setting. Then we drive the information-theoretical lower 
bound in this case, an interesting finding from our investigation 
is revealed that: although introducing potential miss of coding 
opportunities, coded caching scheme adopting padding under 
heterogeneous cache size still presents a constant gap to the 
optimal scheme, and the constant gap is less than 12, which 
is identical to the homogeneous case. The main reason is that 
the miss of coding opportunities is an inherent limitation of 
bottleneck network under the heterogenous cache sizes, which 
appears not only in the coded caching scheme, but also in the 
fundamental bound. 

To further investigate the fundamental limits of heteroge¬ 
nous cache size, we introduce the concept of probabilistic 
cache set and characterize such memory-traffic volume trade¬ 
off and order optimality via the numerical statistics of the 
user cache size distribution. We analytically show that the gap 
of traffic volume produced by coded caching scheme to the 
lower bound will decrease when the deviation of all users’ 
cache sizes increases. This result implies that the fundamental 
bound has a more increasing speed of traffic volume and 
the coded caching scheme will gradually degenerate to the 
uncoded version as the difference among users’ cache sizes 
increases. 

Besides that, we find that the coded caching scheme will 
present the characteristic of the grouping coded delivery 
(GCD), where users are divided into groups based on their 
cache sizes and coded caching are performed on each group 
separately. In particular, when the deviation of users’s cache 
sizes increases, the GCD can be approximately order optimal 
to the lower bound. This finding indicates that coded caching 



scheme could be implemented in GCD under real systems, 
because GCD can gain significant decrease in computational 
complexity at the cost of very limited performance. 

The remainder of the paper is organized as follows. We 
describe the service model and problem setting in Sectio n [H] 
and provide some preliminaries and motivations in Section |III| 
Section [TV] presents our main results under heterogenous cache 
sizes, including our modified coded caching scheme, traffic 
volume-memory trade-off and order optimality analysis. Sec¬ 
tion [V] further investigates this problem under the probabilistic 
cache set. Numerical analysis are presented in Section [VT] 


Section VI concludes this work and exhibits some interesting 
extensions. The proof of our main results and details are 
provided in the technical report. 


II. Model and Problem Statement 


cache. The users divide their cache space into N identical 
parts and randomly choose which bits to cache using a 
random number generator. By uploading the seed value of each 
user’s random number generator, the server can reconstruct the 
caching contents of each user [7). 

In the delivery phase, each user first sends its request d k 
( dk is the index of content W dh , dk £ A f,k £ 1C). These 
requests are independently and identically distributed (i.i.d) 
across the contents and users. Then the server collects all 
users’ requests (d±, d 2 , ■ ■ ■ , dx) and proceeds by transmitting 
a signal X( dl>d2j ... ^ K ) of size R^ 1,d2 ' ,d,K )F bits over the 
shared link. This metric are referred to as the load or the traffic 
volume of the shared link under scheme 3- Using the caching 
contents and signal received over the shared link, each user 
can reconstruct its requested contents. 


In this paper, we consider a set of users connecting to a 
content server through a shared wireless link that is similar as 
the setup in j7|. 

A. Sendee Model 

We consider a network consisted of a content server 
connected to K users through a shared, error-free link, as 
illustrated in Fig. |T| The error-free link can be achieved with 
error correction scheme or reliable transmission scheme in the 
upper layer. The user set is denoted by 1C = {1 ,,K}. The 
content server has accessed to a database of N(K < N) 
uniform distributed contents W±, W 2 , ■ ■., Wn with each of 
size F bitsQ The same-size-content assumption is for the 
theoretical convenience, which however does not hinder the 
practicability of the coded caching operations in the real world, 
because the main body of content objects can be tailored as 
the same size for coded caching based distribution and the 
rest in a small quantity can be distributed in the traditional 
way. The content index set is denoted by J\f = {1,..., AT}. 
Each user k has an isolated cache space Z k of size M k F bits 
for some real number M k £ [0, iV]. All users’ cache sizes 
constitutes a cache set Ad = {Mi,..., Mx^\ Without loss 
of generality, we assume that the cache set Ad is a ordered 
set, i.e.. Mi < M 2 < ••• Mx■ The system operates in two 
phases: a placement phase and a delivery phase. 
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1 The size can also be packet-based. For simplicity, we use tile bits as the 
metric in the following. 

- Here we use the cache set to denote the set consisted of users’ cache sizes 
instead of each user’s local cache Z k . 



K users 


B. Problem Statement 


Then based on above setting, we present the basic defini¬ 
tions in our problem. 

A memory-traffic volume pair (Ad, R^ 1 ’ ’ d *7) is achiev¬ 
able for scheme j? under requests (di,--- ,dx) if every 
user k is able to reconstruct its requested content W dk with 
error probability P e —► 0 and produce the traffic volume 
• • • ,dx)F bits under the cache set Ad. A memory- 
traffic volume pair (Ad, Rf) is achievable for scheme 3 if this 
pair is achievable for every possible requests (di, • • ■ , dx) in 
the delivery phase. Defined by 

Definition 1: (Achievable scheme) 


R 


A 

3 - 


max 

(di,--- ,d K )eR K 


^>(di Ak) 
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(1) 


the worst case normalized traffic volume for scheme 3- 
We use the R* to represent the smallest traffic volume such 
that (Ad,i?*) is achievable. Defined by 
Definition 2: (Optimal scheme) 


(M,R*)± inf {(Ml, R S ),VM, 3} (2) 


the infimum of all achievable (Ad, /(j). 

Clearly, R$ is function of cache set Ad and number of 
users K and number of contents N. To emphasize this 
dependency, we rewrite above traffic volume as f?j(Ad, N , K) 
and IP (Ai. N. K ). The aim of this paper is to find a scheme 
$ such that Ii^(Mi. N. K) guarantees the order optimality, 
'^defined as 

Definition 3: (Order optimality) The scheme a is order 
optimal if only if 

Rs(M,N,K) 

R*(M,N,K)~ ’ () 


C is a constant independent of the system parameters Ad, N 
and K. 

We can see that the order optimality can be guaranteed only 
if the traffic volume produced by scheme 3 has the constant 
Jgwp to the optimum. For the simplicity of the following 
illustration, we refer Ad 7 as the homogeneous cache set for the 
heterogenous cache set Ad, both of which have the identical 
aggregate cache size. Defined by 



















































Definition 4: (Homogeneous cache set) The homogeneous 
cache set of A4 is defined as At 1 = {M, M,... ,M}, where 
M = E k[Mk\ the average size of all users’ local caches. 

III. Preliminary and Motivation 
A. Related Works 

The wireless traffics presents a high temporal variability in 
network traffic volume and the caching is a promising way 
to balance such traffic load. One line of studies O' 0 were 
centered on optimizing the system throughput of the caching 
network without considering the coding in the transmission, 
where the performance is limited by the size of each user. The 
second line of researchers, recently, investigate such problem 
in the information-theoretical perspective and propose a novel 
technique named as coded cache. This technique derives from 
the index coding problem, which is to determine the minimum 
code length to satisfy multiple given users requests under 
given side information, i.e., cache sate, in the broadcasting 
channel 1161- m- The coded caching scheme is a two-phases 
approach including the side information choice (placement 
phase) and index coding delivery (delivery phase). Through 
jointly optimizing these two phases, the code length under any 
user requests can be minimized and the system performance, 
especially in the aspect of traffic volume, can be improved 
significantly compared to the uncoded counterpart. 

Due to the significant gain in reducing traffic volume and 
the straightforward analytical form, this result has attracted 
much attention in the community. There are three sub-lines 


of this feild. The first line of work [15] mainly investigates 
more tight information-theoretical lower bound. The second 
line of studies are aim to apply the coded caching technique 
to other network structures such as hierarchical caching system 
in 0 and heterogenous network with multi-level cache access 
in The third line of works focuses on the more practical 
scenarios. In HD- 03), the authors assume the content has 
the non-uniformly populated contents and propose the specific 
cache allocation strategy to minimize the delivery-phase traffic 
volume. In the authors consider the dynamic content 
set and design the cache updating scheme, termed as coded 
LRS, to approximate the lower bound. Following this line of 
research, in this paper, we consider the more general scenarios 
with heterogeneous cache sizes. We now briefly review the 
original coded caching scheme with an example. 

Example 1 (Decentralized Coded Caching Scheme 5) Sup¬ 
pose the system is distributing 2 contents A and B to 2 users, 
each with the cache size MF bits. The size of each content is 
also F bits. In the placement phase, each user randomly caches 
MF/ 2 bits of content A and B independently. Let us focus 
on content A. The operations of placement phase partition 
content A into four subcontents, A = (Aq, A±, A 2 , 
where U C {1,2}, and Ajj denotes the segments of content 
A that are prefetched in the memories of users in U. For 
example, A\ represents the segments of A only available in 
the memory of user 1. We use | ■ | to denote the expectation 
size of each segment, and \A@\ = (1 — M/2) 2 F bits, |Ai| = 
\A 2 \ = (M/ 2)(1 - M/2)F bits and |Ai, 2 | = {M/2) 2 F bits. 


The same analysis holds for content B. In the delivery phase, 
the worst case is that user 1 and user 2 requesting content 
A and B, respectively. User 1 has cached subcontent A\ and 
A 1.2 in the placement phase and lacks A@ and A ■>. Similarly, 
user 2 has already cached B 2 and B\ 2 , and lacks Bq and 
B\. With traditional uncoded caching scheme, the server is 
required to unicast A 0 and A 2 to user 1 and unicast Bq and 
B\ to user 2. The total traffic volume is 




F + 2 




2 

F = (2 - M)F. 


With the coded caching scheme, the server can satisfy the 
requests by transmitting A 0 , Bq and A 2 CD B\ over the shared 
link, where © denotes the bit-wise XOR operation. The traffic 
volume over the shared link is 





(2 - M)F. 


We can see that, the coded caching scheme has a coded 
gain of (l — in contrast to the uncoded caching scheme. 
The traffic volume for general case can be seen in [7]. 


B. Motivation 

In the regime of heterogenous cache set, an intuition is 
whether the decentralized coded caching scheme S’ can be 
applied straightforward. In fact, making the following simple 
modifications of scheme 5, we can adopt it in our regime. 

In the placement phase, user k randomly prefetches 
M^F/N bits of content n. 

Based on this caching strategy, the same content stored in 
different users’ local caches will occupy the different size of 
memory space. Thus, in the delivery phase, for each multicast 
user group, the size of segment, i.e., A 2 and B\, for each 
user will be different. Since these segments should participate 
bit-wise XOR, all the segments in one group should be bits- 
padded to the longest one. A possible padding method is to 
pad zero bits to the shorter segments and we refer this scheme 
as So. The following example will illustrate the performance 
of scheme So- 

Example 2 (Coded Caching with Heterogenous Cache Set) 
Suppose that there are three similar system distributing N 
contents to K = 2 users. The first system adopts our modified 
coded caching scheme So and the users have the cache set 
Ai = {(2 — a)M, (2 + a)M}F bits with 0 < a < 2. 
The second system divides users into two groups and adopts 
scheme So in eac h group. The users take the same cache set 
A 4. The third system adopts traditional coded caching scheme 
S, and have the uniform cache size Ai 1 = {2M, 2M}F bits. 
Remark that above three systems have the same aggregate 
cache size of AMF bits. 

Assume that user 1 and user 2 request content A and 
content B. Based on the coded delivery technique, the server 
transmitted signal A 2 (BB\, A g and /ig, and the corresponding 
signal size via first system is 


( 4 ) 






The corresponding signal size via second system is 

/ 4 M\ 

R il (M,N,2)= (2- — ) . (5) 

The signal size via third system is 



Then we consider two kinds of scenarios: the cache set of 
small deviation (a = 0.2) and the cache set of large deviation 
(a = 1.8) to refer the situation that the size of each user’s 
local cache is similar and extremely different, separately. The 
number of contents N = AM. 

Base on the traffic volume formula we get the 

following results, shown in TABLE [I] It can be seen that, 
when the size of each user’s local cache is similar, the traffic 
volume produced by system 1 approximates to the system 3 
and much lower than system 2; when the size of each user’s 
local cache is large, it approximates to the system 2 and much 
larger than the system 3. This result comes from a “bits waste” 
phenomenon that when the user cache size varies widely, the 
size of requested segments A 2 ,Bi are largely different, and 
the scheme $ 0 will pad lots of useless zero bits to the smaller 
segment B\, which will diminish part of coding opportunities 
for the longer segment A 2 and thus increase the traffic volume. 


TABLE I 

The coded data and traffic volume incurred 


Scenario/Traffic (bits) 

system 1 

system 2 

system 3 

a = 0.2 

0.7975F 

IF 

0.75F 

a = 1.8 

0.9975F 

IF 

0.75F 


Example 2 shows that when the cache set is approximately 
uniform, the coded caching scheme still has a significant 
caching gain, while when the cache set is extremely different, 
such gain will diminish and the grouping of users (system 
2) will not reduce the performance of whole system. This 
phenomenon and the reason behind it inspires us in three 
dimensions. First, whether the scheme go is still order optimal 
or whether can we develop some new techniques to overcome 
the miss of coding opportunities when the users’ cache sizes 
are extremely different. Second, in the derivation of traffic 
formula 0> there exists amount of maximization operation 
in traffic volume-memory trade-off and the number of such 
operation will increase exponentially with number of users 
I\ increasing. The key is whether we can get a close-form 
expression of such trade-off. Third, what condition the cache 
set should satisfy such that coded caching scheme will have a 
similar performance of the GCD scheme. The following work 
answers these questions. 


A. Information-theoretical lower bound 

The information-theoretical lower bound is independent of 
any specific schemes, instead, only dependent on the system 
parameters including Ai, N and K. The following theorem 
gives this lower bound on the optimal achievable traffic volume 
R* (Ml, N, K) based on the Fano’s inequality and cut set- 
bound argument {20). 

Theorem 1: (Cut-set Bound) For caching problem with 
N contents, K users, and ordered cache set M. = 
{Mi, M 2 , ..., Mx}, we have 


R*{M,N, K) > R C (M,N, K) 


A 

= max 

sG/C 


< max 

(E/C[AT],|£/|=s 



[N/s\ jj 


(7) 


Proof: We provide a proof sketch of Theorem |T] We 
regard each user’s cache and the content server as the nodes 
and the broadcasting links between the users and content 
server as the edges in the bipartite graph. Then, for a feasible 
(At, R) pair, the total information contained in the memory of 
any subset of caches and the server transmitted signals must be 
at least the size of contents that users accessing these caches 
can reconstruct. Using above concept and traverse all subsets 
of users. Theorem |T| follows. See (U for the details. ■ 
Insight on the lower bound: 

(1) In the traditional homogeneous case, the lower bound 

achieves the approximately maximum value of -= when 
s « which implies that the optimal scheme attains the 
delivery-phase traffic volume that is inverse proportional to the 
cache size. The traditional coded caching scheme also attains 
such inverse proportional trade-off and only exhibits constant 
gap less than 12. While in the regime of heterogenous cache 
set, above analysis will become more complicated. Consider 
the assumption that M\ < M 2 < • • • < Mk , the inner 
maximization operation in the lower bound is reduced to 
s — Mi/[N/s\. Then the cut set bound R C (AA,N,K) 

can be regarded as the function of s and it can be proved 
that R c (Ai, N, K) achieves the maximum when s satisfies 
X)i =1 M i + ( s - 1 )M S < N and M i + sM s+1 > N, 
which has a discrete form and is impossible to derive such a 
simple form as in the homogenous case. But we can use this 
condition to simplify such lower bound and make the order 
analysis in the sequel. 

(2) As can be seen in 0. when the users has the identical 
cache sizes, above lower bound will embody the bound derived 
in 0 and we have the following result. 

Corollary 1: (Relation between two kinds of cut-set bound) 
For caching problem with N contents, K users, we have 


R C (M,N,K) > R C (A\ I ,N,K), (8) 


IV. Fundamental Bound and Implementation 
Scheme 

In this section, we will derive the fundamental bound, design 
the corresponding scheme under the heterogenous cache sizes 
and prove its order optimality. 


equal if only if At = At 1 . 

Corollary [T] shows that when the aggregate cache size is 
fixed, if we regard the lower bound as the function of the 
cache set A 4, this function achieves the conditional minimum 
when cache set is homogenous. Namely, the cut-set bound 
will increase when the aggregate cache sizes is distributed 










in a nonuniform manner. This result can be regarded as a 
preliminary illustration for the degenerated performance under 
heterogenous cache set. The reason will be discussed in the 
last subsection and the further quantitative discussion will be 
presented in the Section fv] 

Previous work © G3 points out the cut-set bound is 
sometimes loose, and tighter bounds on R*(A4,N,K) can 
be derived via stronger arguments than the cut-set bound. 
However, the cut-set bound along is sufficient for illustration 
of the main idea of our work and we will use this bound in 
the following discussion. 

B. Coded Caching Scheme 

For clarity, we first present the main procedure of our mod¬ 
ified scheme mentioned before. The pseudocode is shown 
in Algorithm 1. Then we utilize the inherent characteristic of 
scheme $ 0 to prove the zero-padding solution is the optimal 
coded delivery scheme under the random caching procedure. 


Algorithm 1: Decentralized coded caching scheme with 
nonuniform cache size Xi. _ 

Placement Phase 

for (k = 0; k < K; k + +) do 
for (n = 0; n < N; n + +) do 

user k randomly prefetches M k F/N bits of 
content n ; 

Delivery Phase 

for (k = K,k > 0;k - ) do 

for choose k users from K users to form a subset U 

do 

Maxsize <r- ma,x keU \V kjU /{k}\', 

for l £ U do 

if |Vfc,i 7 /{fc} | < Maxsize then 

temp <— (Maxsize — \V k jj/{k}\) bits of 
all zero.; 

Vk,u/{k} v k,U/{k} + temp-, 

Xu <- Xu © V KU/{k} ; 

|_ Multicast the coded data Xu to users in U. 


Operator 0 refers to bitwise XOR operation. In the place¬ 
ment phase, the random caching procedure will divide content 

i into 2 k segments: W l: u, U C 2 K . In the delivery phase, the 
element V k ,u/{k} i n signal Xu represents the segment of user 
k’s requested content that being cached in users of set U / {k 
Since we take the zero-bits-padding for each segment before 
delivery, the size of signal Xjj is determined by the longest 
segment V ktU /{k}> k G U. 

Example 3 (K = 8, XI = { M 1 ,M 2 ,M 3 }) Consider the 
caching problem with N contents and K = 3 users with 
cache set {Mi, M 2 , M 3 }. Based on the Algorithm 1, each 
content k will be divided into 8 sub-contents such as A = 

3 If user k's request is d k , then 14 ,t//{fe} = 


{A 0 ,A 1 ,A 2) A 3 ,A 12 ,A 13 ,A 23 ,A 123 }, and the transmission 
signal and corresponding signal size are, 

• |Vi >a 3 ®V 2 ,130^3421 = |Vi l23 | = (1 - ^) (%) (f * 1 * 3 ); 

• |Vi >2 © V 24 I = 0 , 2 1 = (l - ^) {%) (1 - #); 

• 1^3,1 © v li3 \ ( = } ITX3.il = (1 - $■) (1 - f^) (^); 

• \V 2 ,s © 14,21 ( = } |W 2)3 | = (1 - $0 (1 - %) (f); 

• \V lt0 ,V 2t0 ,V 3 ,0\ = 3 (1 - Ml) (1 - (1 - §0 

The operation (a) is based on the zero-bit-padding technique. 
Assume the users request content A, B and C in the delivery 
phase. The segment 14,23 = 03, 14,13 = B 13 , and other 
segment has the same meaning. The following lemma shows 
an important characteristic of this longest segment. 

Lemma 1: (Ordered segment size) Based on Algorithm 1, 
the size of each transmission signal Xu is, 


\Xu\ = max |14,a/{fc}| = 114nf C7,C7/{inf C7} |- (9) 

k£U 

Lemma [T] shows that the longest segment of signal Xu 
is user inff/’s requested content 14 nf u,u/int t/> w ho has the 
minimum cache size of user set U. This result is intuitive, since 
the segment V k jj/k represents the logical unit of content W,t k 
that only stored in the caches of users in U/k and related to 
the the cache size of these users, if user k has the minimum 
cache size of all users in U, the users in U/k will has the 
largest memory space to store this logical unit and yield the 
largest size of it. The strict proof can be seen in HD- 
Based on Lemma [T] we have the following theorem, 
Theorem 2: (Optimality of the coded delivery scheme) The 
zero-padding scheme in delivery phase is the optimal scheme 
under the placement phase scheme of Algorithm 1. 

Proof: Here is a proof sketch of Theorem [2] which is 
based on the concept of the index coding m- As shown 
before, the placement phase scheme of Algorithm 1 will 
produce N ■ 2 K segments, and during the delivery phase, the 
users will request different contents in the worst case. We 
regard each segment as the node and there exists an arc from 
node i to node j if only if content segment i has been stored 
in the user that requests content j. Besides, there also exists 
an arc between node V k .u and V k ,u' if only if U C U' or 
U' C U, because these two kinds of segment can be regarded 
in the same node in some cliques. Then, the optimal scheme is 
to find the maximal set covet of such graph according to the 
index coding theory, based on which we design the optimal 
coding scheme. Finally we prove that the delivery scheme in 
Algorithm 1 will incur identical traffic volume compared to 
the optimal coding scheme. Theorem [2] follows. See | |T9) for 
the details. ■ 

Theorem [2] shows the optimality of coding scheme of 
Algorithm 1 with respect to the given caching procedure. Here 
is an illustrative example. 

Example 4 (Optimality of the coded delivery scheme) 
Suppose the system is distributing N = 3 contents A, B, and 


4 Here the set cover and cliques mentioned below are the concepts of the 
index coding and have the same meaning in the graph theory, further details 
can be seen in (l6j 













C to I\ = 3 users with Mi < M 2 < M 3 . In the worst-case, 
the users will request content A, B and C. Then we construct 
the graph based on above proof procedure as illustrated in the 
Fig. Under the coded delivery scheme in Algorithm 1, it only 
encodes the following four cliques: (A 23 , B 13 , C 12 ), (Ai, B 2 ), 
(A 3 , Ci) and (i? 3 , C 2 ). However, there exists other larger 
cliques such as (B 13 , A 2 , Bi) and (A 23 , A 3 ,Ci) as shown in 
the red and blue line in the Fig [2] Here we draw a dotted line 
between node f ? 13 and B \, since they can be regarded as the 
same node in the clique (-B 13 , .4 2 , li \). We can find that larger 
cliques always occur among the different type segments, i.e., 
Ajj and A,. Thus, we can design the following optimal coding 
scheme that always pads the useful bits subtracting from A t j 
to A,, . Consider the bit-wise operation between A 2 and B t , 
since the length of B\ is shorter than ,4 2 , the optimal coding 
scheme will pad the useful bits subtracting from segment B 13 
to B 1 , then the length of S 13 will be reduced. Based on this 
procedure, we have the results shown in TABLE [H] 

TABLE II 

The coded data of scheme So and optimal scheme 


Index 

Scheme 3 o 

Optimal system 

1 

a 2 e Si 

A2 © = B\ + ^13} 

2 

Me Ci 

a 3 © {c; = Ci + c; ? } 

3 

S3 © C2 

S 3 © {C^ = C 2 + C' 13 \ 

4 

A23 © S13 © C12 

A23 © {S13 - s; 3 } © {C12 - c' 12 } 

5 

A 0 ,B 0 ,C 0 

A 0 ,B a ,C 0 


It can be seen that, in the 4th transmission, the length of 
B 13 and C 12 have been reduced due to the useful padding of 
previous transmission. However, the traffic volume produced 
in 4th transmission is not reduced because the length of the 
longest segment A 23 is not reduced. Thus, the scheme 3o 
produces the identical traffic volume of the optimal scheme. 
The strict proof and details can be seen in. 

Lemma 2: (Traffic-memory trade-off) For caching prob¬ 
lem with N contents, K users each with ordered cache set 
Ai = {Mi, M 2 ,..., Mjf}, the scheme go produced the traffic 
volume of 


K s 


R do (M,N,K) = Y J Y, 


S — l 2=1 


n Mj 


Vi=*+i 


N 


nil M ‘ 


3 =1 


E u lN ~ Mjk 




Mi, 


N 


( 10 ) 


where the (ir)l = 1 if x = 0 ; else (aip = x. 

Lemma [2] presents a possible formula to calculate the traffic 
volume under ordered cache set A4, which is based on the 
natural summarization of the delivery scheme in Algorithm 


1. The summarization procedure is tedious and details can be 
seen in UD- However, the trade-off formula in Lemma [2] is 
difficult to calculate due to the exponential number of terms in 
the summation. And we cannot get any useful insight, i.e., how 
the user cache sizes influence the traffic volume, from such 
trade-off. Fortunately, this formula shows an useful iterative 
structure in aspect of number of users K that we can utilize 
to get a close-form expression of the traffic volume under 
scheme $ 0 and show some basic insights. 

Theorem 3: (Close-form expression of the traffic-memory 
trade-off) For caching problem with N contents, K users, with 
ordered cache set Ai = {Mi, M 2 ,..., Mr}, the scheme $ 0 
produces the traffic volume of. 


K 


R Sr: (Ai.N,K) = ^ 


i—1 


n 

i=i 


1 - 


Ml 

N 


( 11 ) 


Proof: Here is a proof sketch of Theorem [3] We use the 
mathematical induction method to simplify and induction basis 
is the number of users. First, the traffic volume under k = 1 
users is 1 — M\/N\ then we assume the traffic volume under 


k = K — 1 users is XEi 1 


IE-i (1 - #) 


and prove the 

traffic volume has the form of when k = K based on 
the trade-off |2]i and assumption when k = K — 1, Theorem[3] 


follows. See 


for the details. 


The proof of Theorem [3] and the traffic volume formula 


(111 provide us with an observation that, in the aspects of .sth 
transmission, the traffic volume due to the introduction of user 
K can be seen as the average combination of (s — l)th and sth 
transmissions when there are (K — 1) users. The main reason 
is that the random caching in the placement phase and coded 
multicasting procedure in the delivery phase “connects” the 
users’ local caches together, such that the requested sent from 
a new user can be not only satisfied by its own cache but also 
by other users’ caches. 


C. Discussion 

We use the close-form trade-off to derive the following 
equivalent network model and scheme. 



Fig. 3. Equivalent network diagram of coded caching. 

As shown in the right part of Fig. [3j the equivalent network 
diagram consists of a content server connecting to K users 
through an error-free unicasting link. Each user can utilize the 
contents that being stored in the users having smaller cache 
size, and communicate with them by an unicasting link. The 
caching procedure is same as scheme and the delivery phase 











































uses the hierarchical unicasting transmission. When user k 
sends its request dk , it first uses its local cache to reconstruct 
part of its requested content W,i k , then it successively uses 
user (k — 1) to l’s local cache to reconstruct W,i k ■ finally the 
server unicasts the part of content Wd k that are not stored in 
user 1 to fc’s local cache to user k. Thus, the traffic volume 
incurred in the unicast link between server and user k is 


K 

n 


Mi 

1 -- 

N 


Summming the traffic volume produced by all users, we can 
get the traffic formula that is identical to ( fTT) . 

From this equivalent network diagram, we can see more 
clearly how the coded multicast plays a role in “connecting” 
users local cache. For the traditional uncoded cache, each user 
only uses its local cache, while for the scheme Fo, each user 
can access other users cache that has a smaller cache size. 

Corollary 2: (Traffic volume under uniform cache set) 
When the size of each user’s local cache all equals to M, 
and the corresponding cache set is A4 U = {M ,..., A/ }. The 
number of users is K, we have. 


Rz 0 (M u ,N,K) = K 



N 

KM 



This result has been exhibited in previous work [7|, which 
can be regarded as the special case in our regime that the 
users’ cache sizes are homogeneous. 

Corollary 3: (Traffic volume under singularity cache size) 
When the the cache set is A4 S = {M, M ,..., (1 +a)M}, a > 
0, and the number of users is I \, we have. 


M ( M\ 

R- 3 o(M s ,N,K) = R(M U ,N,K) — a— (l - — J 

Corollary [2] shows that, the increase of aggregate cache size 
KM will reduce the traffic volume in an inverse proportional 
manner, as show in the term N/KM. However, Corollary [3] 
shows that the increase of aggregate cache size ( I\ + a)M, 
i.e., the increase of parameter a, will reduce the traffic in a 
linear manner. Thus, the effect of the heterogeneous cache set 
on the traffic volume has a different form compared with the 
homogeneous cases. 

Corollary 4: (Relation between two kinds of traffic- 
volume) For caching problem with N contents, I\ users with 
ordered cache set Ad = {Mi, M 2 ,..., M/<-}, we have. 


R^ o {M,N,K)>R^ o {M i ,N,K), (12) 


equal if only if M k = E/ c [Mfc], V/c £ K. 

Proof: Here is a proof sketch of Corollary [4] We regard 
this problem as the conditional minimum of R$ o (Ai, N, K) 
with respect to the summation of Mi is constant. We first 
prove the following operation can strictly reduce the value of 
function: M' = (Mi + Mj+i)/2 and M' +1 = (Mj+Mj + 1)/2. 
Then, we use the contradictory induction to prove above 
function attains the minimum when each M„; is equal. See [191 
for more details. ■ 


The Corollary [4] is easy to understand for the scheme 3 0 , 
since the there is no miss coding phenomenon when the size 
of each user’s local cache is identical, the traffic volume will 
be minimized. 


D. Order Optimality Analysis 

Based on above close-form expression of the traffic volume- 
memory trade-off and the cut-set bound, we have 

Theorem 4: (Order optimality of scheme go) For caching 
problem with N contents, K (K < N ) users with heteroge¬ 
nous cache set Ad = {Mi, Mi, ..., Mk}, we have. 


R do (M,N,K) 
R*(Ad,N,K) ~ 


(13) 


Proof: The proof of Theorem [4] is extremely complicated 
and we provide a sketch of it. We divide this proof into 
following 15 cases based on the relation among N, p (p is 
defined in the first insight of lower bound) and 12, and prove 
that, in each case, such gap is less than 12. See for more 
details. ■ 

It can be seen that the scheme g Q exhibits the constant gap 
12 to the cut-set bound. In fact, we will show in the numerical 
analysis that this gap is less than 4. As analyzed before, since 
the miss of coding opportunities in this regime will lead to 
the increase of the traffic volume, the only reason for such 
constant gap is that the increase of cut-set bound, i.e., a more 
increasing speed due to the heterogenous cache set. Here we 
discuss the main reason behind this trend. The cut-set bound 
argument presents an essence that the broadcasting signal of 
caching network plays a role in combining users’ caches to 
joint reconstruct their requested contents, and for a user group 
U of s users, the minimum size of broadcasting signal will 
always be bounded by those s minimum caches. If the cache 
sizes are distributed in a nonuniform manner, the gain coming 
from such combining information retrieval will be weaken, 
namely, the broadcasting signal is limited in this regime. Thus, 
although coding scheme will miss the coding opportunities, 
it still guarantee the order optimality. 

Moreover, this constant gap is related to the deviation of 
the cache set. Consider two extreme cases: in the first case, 
the cache set has the smallest deviation that the all users’ 
cache size is identical, then the gap has been proved to 12; 
in the second case, the cache set has the largest deviation 
that the cache size of half users is 0 while anther half is N, 
then the traffic volume and the corresponding information- 
theoretical lower bound is all K/2, the constant gap reduced 
to 1. This result implies that the lower bound might have a 
more increasing speed than the coded caching scheme with 
the deviation of the cache set increasing and the coded mulit- 
casting efficiency in such network will gradually degenerated 
to the uncoded manner. Then it is natural to ask how such 
gap relates to the deviation of the cache sizes. In the next 
Section, we will use the concept of probabilistic cache set to 
quantitatively investigate the relationship between this constant 
gap and the characteristics of the cache set. 







V. Theoretical Analysis 


In this section, we assume the cache size of each user is a 
random variable and the corresponding cache set is referred to 
probabilistic cache set. We derive the expected gap between 
the traffic volume produced by scheme 3o and the cut-set 
bound under the specific cache size distribution. Besides that, 
we preliminarily investigate the grouping effect of the coded 
caching scheme under heterogeneous cache size. 


A. Order Optimality under Probabilistic Cache Set 

Let Mk = {Mi k, M2.k, ■ ■ • j Mk.k} denote the cache 
set, where Mk is a set of order statistics that comes from 
a common parental distribution F. Here, the order statistics 
represent the a series of independent random variables such 
that Mi k < M 2 .K < ••• < Mk.k- Further detail can be 
seen in ED- 

Correspondingly, the order optimality is defined by 

Definition 5: (Expected order optimal) The scheme 3 is 
order optimal if only if 


Rz(M K ,N,K) 

R*(M k ,N,K) 


< (7(F). 


(14) 


We can see that this constant is function of the parental 
distribution F. In the following discussion, we will show that 
this constant is a function of 1th moment and 2th moment of 
F when K = 2. 

Theorem 5: (Order optimality for normal distributed cache 
size) For the caching problem, N(N > 2) contents, K = 2 
users each with cache size M 2 = {Mi, 2 , M 2 . 2 }, and comes 
from a normal distribution Af(p, a 2 ), then 


% o (M 2 ,V,2) 
R c (M 2 ,N, 2) 


^ o V*P + a 


(15) 


From Theorem [5] we can clearly see how the constant gap 
related to the variance of the cache set, where the variance 
of the cache set plays a linear negative role in such constant. 
Similarly, for the uncoded caching scheme, such gap is less 
than 

2 - — 


which has a faster speed in the uncoded cache, seen 2cr rather 
than the a of coded version. Consider the traffic volume 
produce by the uncoded caching scheme 3u is = 2 — ^ 
and irrelevant to the variance of cache set. Thus the descending 
gap of uncoded version mainly come from the increment 
of the cut-set bound, which further implies the deviation of 
cache set will strengthen the inherent degenerated performance 
of wireless coded mulitcasting. Remark that the variance a 
cannot be infinity since the range of Mk is [0, N ]. 


B. Grouping Effect of Heterogenous Cache 

When the users’ cache sizes are extremely different, the 
coded caching scheme will approximate to the GCD. This is 
illustrated in the following example. 

Example 5 (Group coded delivery) Suppose a system is 
distributing N contents to K users (here K is an even number) 


with the cache set in first scenario is A id = {A4i,M 2 }, 
where M\ = {M,... ,M}, M 2 = {aM .. ,aM},a > 0, 
and \Mi\ = \M 2 \ = {{. Under the scheme 3o, we have 

RtoiMuN, |) + Qk ■ RzAM 2 ,N, §), 
where Qk = (1 — M/1V) T . Since aM < N, we have Qk > 

2 k 2 , 

(1 — 1 /a)" 5 ". Then the cache set Md has a large deviation 
corresponds to the large value of parameter a and that Qk 
approximates to 1. Thus, the traffic formula R$ o (Md, N, K) 
will approximate to the traffic produced by two groups. 

As for the general case that has L groups, we use the 
concept of probabilistic cache set to develop an upper bound 
of the increment of traffic volume due to the GCD and show 
that this upper bound is related to the deviation of the cache 
set, i.e., the larger deviation, the smaller gap between coded 
cache and GCD. 

Theorem 6: (Expected upper bound of GCD) For caching 
problem with N contents, K (K < N ) users with proba¬ 
bilistic cache set Mk, we divide all users into L groups 
K. = {K 1; ..., K l }, M k = {Mk,...,Mk}, then we have. 


%,(Mk,JV,|K|) 


< 


-L r+° 


(16) 


It can be seen that this bound is dependent on the expec¬ 
tation and variance of distribution F and shows a trend that 
the increment of the traffic volume will be reduced when the 
variance increases. This result means that the heterogenous 
cache set will diminish the effect of the GCD. The reason that 
we get this kind of form of this upper bound can be seen in 
the full version of this paper 1191. The Fig. [4]partly shows the 
tightness and effectiveness of this bound. 
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Fig. 4. The comparison between approximate upper bound ( | 1 6[ > and real 
value of increment of traffic volume under four kinds of deviation of cache 
set. The system parameters N = 500, K = 300 and /x = 200. 


As show in Fig. [4] when the variance is extremely small, 
our approximate bound is tight, especially when the number 
of groups is small, while when the variance is large, this 
approximate bound will be a little loose. Besides that, another 
important observation is that he increment of traffic volume 
is factually convergent to a constant. In particular, when the 
variance a = p, this increment is bounded by only a constant 
factor 2. 

Moreover, the grouping effect of the heterogenous cache 
size provides us with an insight that, in the practical scenario, 
the coded caching scheme should be operated in the grouping 
manner. As shown in Fig. |4[b), the variance cr = 0.5/ti to 
denote the most practical scenario. If we divide the 300 users 
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into 15 groups and each group has 20 users, the complexity 
will be reduced from 2 3 00 ss 10 90 to 15 • 2 20 ss 10 7 , while 
the increment of traffic volume is just two times. 

VI. Numerical Results 

In the previous section, all analysis is in the scenario that 
I\ < TV. In this section, we present the numerical results 
when K > TV. Then, we investigate the impact of the system 
parameters on the delivery-phase traffic volume. Moreover, we 
systematically investigate the performance of GCD. 



(a)N=100,K=50 


{b)N=50,K=100 


Fig. 6. The impact of deviation of cache set on the gap. 


A. The Effect of Cache Set 

We have proven that when K = 2, the constant gap is 
related to the variance of the cache set. For large K , we use 
the numerical analysis to show how the constant gap scales as 
the deviation of the cache set. To guarantee the deviation of 
the cache set, the average cache size cannot be too large. In 
fact, this constraint is reasonable, since the number of contents 
is mostly much larger than the cache size. Thus, we set the 
maximum average cache size // = 0.31V, and the cache set 
comes from a normal parental distribution. 




(a) N=100, K=50 (b) N=50, K=100 

Fig. 5. The impact of average cache size on the gap between i?g (Ad, N, K ) 
and R C (JA, N, K). Note that the scale in the (a) is a semi-logarithmic 
coordinate system. 

The gap in Fig. [5] and Fig. [6] refers to the constant gap 
between traffic volume produced by scheme $ 0 and cut-set 
bound. Fig. [5] plots the gap versus the average cache size // 
under fixed variance of cache set. The variance under small 
and large deviation are denoted by a = 0.1 p and a = 0.7 p, 
separately. The zero deviation refers to the traditional case that 
the cache size is uniform and we regard it as a baseline. In 
the Fig. [5ja), the number of contents TV — 100 is larger than 
the number of users K = 50. The gap under large deviation 
is strictly less than that under small deviation, and less than 
the baseline. While in the Fig. |5jb) that TV = 50, K - 100, 
this relation shows piecewise characteristics: when the average 
cache size is small ( p < 0.25 AT), it has the same manner as 
TV > K, when the average cache size is large ( p > 0.257V), 
the gap under homogenous cache set decreases dramatically 
and less than the other two cases. 

Fig. [6] plots the gap versus the variance of the cache set 
under fixed average cache size p = 30. It can be seen that, the 
performance gain of coded cache compared with the uncoded 
cache is 15x under small deviation (er = 0.05/z), while only 
approximately 3x under large deviation (a = 0.6//). For the 
case that TV > K, seen in Fig[6|a), both gap-variance curves of 


coded and uncoded cache have a declining trend and uncoded 
version performs faster. While for the case that K < TV, seen 
in Fig [6fb), the gap-variance curve of coded cache shows an 
unique single valley manner: under small cache sizes, it first 
increases, then it decreases linearly when the variance is large. 

B. The Impact of number of contents and users: TV and K 

Then we present how the gap scales as the system param¬ 
eters such as TV and K. Assume there are two relationships 
between K and TV: TV = 0(AT) and TV = uj(K). The first 
case refers to the number of users and the number of contents 
have the same order. The second case refers to the number 
of content is extremely larger that the number of users. The 
characteristic of cache set is p = 30, cr = 0.3 p. 




(a) N=8(K) (b) N=o>(K) 

Fig. 7. The impact of system parameters N and K on the gap between 
R$ o (.M, TV, K) and R C (A4 , N,K). The average cache size is yu = 30. 


In Fig. |7Ja), we can see that the gap gradually increases to 
a constant when TV is sufficiently large, while in Fig. [7}b), 
the gap gradually decreases to 1. Based on the asymptotic 
analysis, we can get easily get the reason for TV = uj(K). 
Since TV — > oo, each term in (111 will approximate to 1 and 
R do (Ai, TV, K) will approximate to K. In the similar manner, 
the cut-set bound TV, K) will also approximate to K. 

Thus the gap will approximate to 1. The reason for TV = 0(A') 
is based on an extremely complicated series analysis and can 
be seen in the technical report ca¬ 


vil. Conclusion and Open Problems 

In this paper, we have investigated the fundamental limits 
of coded cache under the heterogenous cache set. Through 
deriving the fundamental bound in this regime, we have 
pointed out that coded caching scheme with zero-bits-padding 
can still guarantee the order optimality. Moreover, using the 
concept of probabilistic cache set, we have proven such gap is 
closely related to the deviation of cache set. And above results 


















































show the inherent degenerated performance of the bottleneck 
network under the heterogenous cache sizes. Besides that, we 
also show the grouping effect of the coded caching scheme and 
this result provide an insight with us that the coded caching 
scheme under the heterogenous setting should be operated in 
the grouping manner. 

There are several important open problems induced by the 
heterogenous cache framework: 

• In this work, we point out the necessity of GCD under 
heterogenous cache. What is the optimal grouping plan 
if the number of group is given? 

• The content popularity and heterogenous cache size will 
both cause the miss of coding opportunities. How is the 
performance when we consider them simultaneous? 

• This work focuses on the worst-case analysis, how is the 
performance if we study this problem in the aspect of the 
average performance? 
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